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ABSTRACT 


A system for experimenting with semantic representations 
4s developed. The existing REL System is discussed and 
comparisons are eee The proposed system, written aya 
LISP, is described. A method for extending the user's 
language is developed. Recommendations for further study 
are presented. Program listings and sample results are 


included in the appendices. 
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ee ee ODU CARON 


Question answering systems provide interaction between 
humans and computers in a natural language. A question 
answering system will accept queries to a data base and make 
replies in a language closely approximating natural English. 
The main components of such a system are a syntax for the 
Query language, a data base, and some executive functions. 

Current question answering systems are limited in their 
linguistic and semantic capabilities. Even the most success- 
ful systems will not allow the use of extremely elaborate 
sentence construction or obscure or poetic phraseology. 

The forms in which data is represented in the data base are 
also restricted. In successful systems data is represented 
by the equivalent of lists and tables, and the semantic 
functions generally are composed of various counting and 
Pere ing operations. Other types of semantic functions, such 
as cause and effect and inductive reasoning operations, are 
not available. 

The present question answering systems appear to have 
progressed significantly toward solving purely syntactic 
problems. However, in the area of semantic problems such as 
reasoning and data representation, these same question 
answering systems appear to work at a very primitive level. 

There exists a need for a system which would allow 


researchers to experiment with the semantic representation 





of data. This report describes one such system, the Skeletal 
Semantic Experimentation System (SSE). The SSE system is 
so called because it provides a skeleton system (syntax, 
parser, and a partial executive system) to which the user 
must add desired semantic procedures and a data base. 
Such a system as SSE would be required to have the 
foMmMowing characteristics: 
a) It must have an English syntax. The syntax must 
mepresent a major portion of natural English and be 
comprehensive enough to contend with problems which are 
purely syntactic in nature. For example the system must 
recognize that "A BOY" is an acceptable phrase while "A 
BOYS is net. 

The syntax must have a minimum of semantic precon- 
ceptions. It is not desired that the system have defined 
meanings for words. Bees mpile the system should accept 
the phrase 

YDOGSBAYS 

without making reference to a four legged animal. 

b) The system should use a parser which is compatible 

With the syntax and is reasonably flexible and efficient. 

c) The system must be transportable. It must be machine 

independent, relieving the experimenter of the task of 

modirime the system for his configuration. 

In the past, for a researcher to experiment with data 
representations, he has had to expend considerable time and 


effort redoing work that had been previously accomplished. 





Rather than concentrate his efforts purely on the data 
representation methods, a researcher has had to generate a 
Syntax to represent natural English as well as a parser and 
control programs. The purpose of this project was to supply 
fmiemesecarcher with a transportable system, written in 

LISP 1.5, [2], that would remove the requirement to generate 

a new syntax and parser. The system was developed by combining 
the well documented English syntax of the Rapidly Extensible 
Language System of B. H. Dostert and F. B. Thompson [3,4,5] 
with the General Purpose Parser of E. W. Merriam [7]. 

Desveru sand Thompson's REL system, which has a parser 
DuUa1lt into it, is written in assembly language and is there- 
fore totally dependent upon the type machine in which it 
resides. There are two choices open to an individual wishing 
CO use a portion or all of the REL system. He may wish to 
use the entire system on a Prentiss With whiten 1t 1S compay— 
ible or he may reprogram the desired portions of the REL 
system. The second alternative was chosen. 

It is realized that programming in assembly language may 
be more efficient and when one is developing a production 
system this fact must be considered. However, the goal of 
SSE is not production but rather a transportable system that 
can be used by researchers at different computer facilities. 
For this purpose a higher level language is more appropriate. 

SSE takes a sentence, parses it according to the REL 
English syntax, and returns the sentence in a diagrammed 


format. The returned sentence is then ina form that the 





mrecner can use. He can apply his semantic procedures 
je may wish to extract information from the sentence 


moe it for further experimentation. 
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foe ENGLISH — A GENERAL LANGUAGE 
As a general language, REL English encompasses a consid- 
Saaole smpSepe ot natural English. In addition, REL English 
has the capability of performing elementary mathematical 
meecolons. However, REL English will not accept all of the 
Beammacreal constructions of ordinary English. Colloquial 
and extremely elaborate constructions are not part of REL 
Dae lish. 
1. REL English Vocabulary 
ime vocabulary for REL English is limited to "little" 
fomasesuem as, “who', “which", “and", and "before". These 
"little" words are used in the definitions of the REL English 
syntax. The user then adds his own vocabulary to the system. 
Cee hoeEmeiish Grammar 
AG present the grammar for REL English contains in 
excess of 350 rules [3] which are based on the centrality of 
the verb. The verb is considered to be the fulcrum of the - 
sentence, and accumulates noun phrase modifiers, adverbial 
modifiers and auxiliaries about it. The parts of speech for 
REL English are as follows: 
Homme nickwon PARTS OF  SPBRECH : 
NP noun phrase Jolin, DOY., Locations of Doys 
VP verb phrase Cive wwWiello give, give books 
CV copula verb (i.e., be and have verbs) 
NU number 


AT adverb of time before June 1967, next year 


10 





CP comparative preater than, equal to 

AMD, time displacement 3 days, June 

DI digit 3 

DS digit string 345 | 

LO locative in Boston, where John lived [5]. 

Ucmemene Syntax tor REL English, a considerable 
Variety of natural English constructions can be interpreted. 
mmese constructions include complex verb structures, relative 
clauses, complex noun phrases, and conjunctions (both noun 
and sentence). 

3. Features 

iiewsoenetemconcepl Of REL English which results in 
mie ssuccesstul evaluation of a syntactic structure, be it 
sentence or phrase, is the concept of "features." The idea 
@:stcatures is not a new one. It has been used before to 
Subcategorize parts of speech, to identify, for example, a 
Noun not just as a noun but as a plural noun or a possessive 
moun. The concept of features was implicit in a paper by 
G. N. Harman [6] and was discussed by Chomsky in Aspects 
of the Theory of Syntax [1]. 

The use of features in REL English results in a 
hierarchical structure of syntactic requirements. This 
ordering of syntactic requirements helps to control syntactic 
ambiguity and prevent ungrammatical constructions. For 
example, "some the boy" would not be accepted because 


quantifiers can not be combined with a noun which has had 


LEE 





the "definite" feature set due to the presence of the 
Gemmmitce article. 

Bach part of speech is assigned a class of features 
which can be used with that part of speech. During the parsing 
mmmemoecicence. the feature list for a given word or phrase 
will be compared with the features required by the grammar 
rule in question. If the features match then the rule can 
be applied. Appendix 1 lists the features for each part of 
speech with an example for each feature. 

Features are also used to determine the best syn- 
tactic grouping when several groupings are acceptable. 
Weacert and Thompson find that grouping best which minimizes 
the number of accesses to the data base. For example, con- 


sider the phrase: 
John's son who lives in Boston. 
Of the two possible combinations: 


Aedes “son whe lives in. Bosten) 


b) (John's son) who lives in Boston 


(b) would be the best grouping since John is likely to have 
fewer sons than there are sons who live in Boston [4]. The 
number of accesses to the data base that the system makes 
1s an important consideration for the REL system and other 
Paedice1on QUeEStI1ON answering Systems; since excessive 


accesses to a disk file can greatly reduce response time. 
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Ee ADVANTAGES OF THE REL ENGLISH SYNTAX 

The REL English syntax is well defined and documented. 
Sample sessions provided by Dostert and Thompson [5] show 
that the REL system works well, even with complicated sentence 
eeructures. REL Seen appears Virtually free of semantic 
@esumptions with the following exception: In defining a 


noun, REL English allows for three cases: 


a) aname, such as John or book; 
b) a class, such as firl; 


ec) a relation, such as sister or location. 


In a natural language system certain purely syntactic 
meoblems Cevelop regardless of the type of data represen- 


Cation being used. The example of the distinction between 
A BOY and A BOYS 


Pmoweasceim DOInt. REL English with its concept of features 

does a most adequate job of handling these syntax problems. 
But the most important advantage of the REL English 

syntax is that its documentation allows one to take advantage 


of the work that has been done and continue experimentation. 


iS 





Dil. Sethe ARSBR 


To provide the required parsing operation, a slightly 
Hieeeereed version of HE. W. Merriam's General Purpose Parser 
was used [7]. The parser, written in LISP 1.5, takes a list 
of symbols, where each symbol stands for a word or phrase, 
lice Of @rammar rules to be used in parsing the string. 
meevmool may be an atomic symbol, in which case it is a part 
memewieecn, Or it may be a list structure. If it is a list, 
then the CAR (first element) of ere list will be used as the 
part of speech by the parser. 

In Merriam's parser a separate copy of the parsing rules 
ieeepassed with each input string. In SSE the syntax rules 
are returned to "PARSE" as the result of a call to the 
function "RULES". In both versions the second argument for 
"PARSE" is a list of atomic symbols representing the sentence 


to be parsed. 


A. RULES OF GRAMMAR 

As in Merriam's version of the parser, the grammar rules 
Comcast Of tWO parts. There is a left and a right side 
Seperated by a dot. Each of the grammar rules takes the 


Horm: 


(( categories ) . f-name ). 
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fit. DieeeARSER 


To provide the required parsing operation, a slightly 
Meeutied version of E. W. Merriam's General Purpose Parser 
Bee used 7]. The parser, written in LISP 1.5, takes a list 
of symbols, where each symbol stands for a word or phrase, 
and a list of grammar rules to be used in parsing the string. 
A symbol may be an atomic symbol, in which case it is a part 
Pmeespeech, or it may be a list structure. If it is a list, 
then the CAR (first element) of ane i'Stewae) bevused as the 
part of speech by the parser. 

In Merriam's parser a separate copy of the parsing rules 
Memeaeseda with each input string. In SSE the syntax rules 
are returned to "PARSE" as the result of a call to the 
function "RULES". In both versions the second argument for 
"PARSE" is a list of atomic symbols representing the sentence 


to be parsed. 


A. RULES OF GRAMMAR 

As in Merriam's version of the parser, the grammar rules 
consist of two parts. There is a left and a right side 
separated by a dot. Each of the grammar rules takes the 


jeoirm: 


(( categories ) . f-name ). 
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ermcalveeOries 


ine letiteside of aeerammar rule is a list of categories. 
mecavceory is an atomic symbol that will be compared with the 
Seieinel input string and future strings generated by the 
parsing procedure. If, during a comparison, a group of 
Syimpols in the input string match, by number and position, 
meemovmbols in the list of categories, then the function 
fi@ae Mame is the right side of the rule is called. A list 
Or the symbols that were matched in the input string is 
passed to the called function as an argument. 


Comsader the input string 
Clive iewmea OY 3). 


Later in this report the method by which a word such as BOY 
Can be defined to the system will be shown. Suffice it for 
now to say that BOY would be recognized as a noun phrase and 


the input string would look like 
(T HE (NP NAME BOY) S ). 


fe@emlleaty this point that since the input string contains 
mince nme CAR of that list, "NP," will be used when compared 


with the grammar rule 
(( NP S) . PLURAL). 


In the grammar rule the list of categories is ( NP S ). 
If the input string contains the atom "NP" followed by the 


atom "S", then the function "PLURAL" will be called. The 


ise 





injewe (NP NAME BOY) S), from the input string, will be 
passed to "PLURAL" as an argument. 

The value returned to the parser by the called function 
must be in form of a list. For example, the purpose of the 
mune>cton ‘'PLURAL” is to build a new noun phrase which is the 
Plural of the noun phrase in the argument. It must also 
turn on the plural feature and return the new noun phrase. 


Therefore, PLURAL would return 
CNP NAME BOYS PLF +)) 


wo vhe parser. This list will then be used for further 


parsing of the input string which now has the form 
Clete: NAME BOYS PLE +) } 


Peeeuse vot “ANY 
One of the significant features of Merriam's parser 
is the use of the word "ANY". If the word "ANY" appears in 
Miemitish OL Categories, then any symbol occupying the 
SCemresponding position in the input string will be accepted. 


For example, both strings 


1) <8 OH N  Y=NAME) 


2) (M A RY :=NAME) 


Molwlda result in a call to the function “P1" when compared 


with the rule 


CORAN Ns =I AMip) 3 Ed). 
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As will be seen, this feature plays a major part in the 


MWsers ability to define new words to the existing system. 


Eee COMPATIBILITY WITH REL ENGLISH SYNTAX 

To make the Merriam parser and the REL English syntax 
Gempatible, little modification was required. It was 
necessary to make the REL English parts of speech atoms. 
Peeemrase is built of the CONS of the atomic part of speech 
emcee Gescription list for that phrase. For example, 
the word "DOG" would be represented as a list, the CAR of 
which would be the atom "NP" for noun phrase. The description 
list of the phrase would contain the attribute "NAME" and its 


value "DOG": 


( NP NAME DOG ) 


Ef, 





ive LHe Soe SYSTEM 


The Skeletal Semantic Experimentation System is a pro- 
eam wileh couples the modified version of the Merriam 
Berser to the syntax of REL English. SSE was programmed in 
LISP for several reasons. First, LISP is a widely used 
programming language among researchers in artificial 
moecllie@eence. second, Merrlam's parser is written in LISP 
and the REL English was in such a form that it could be 
implemented in LISP with relative ease. However, the primary 
Gemsideration for using LISP was portability. 

Because SSE uses no non-standard LISP functions, it 
Should be compatible with any computer system having a LISP 


iomecompiler or interpreter. 


A. IMPLEMENTATION OF FEATURES 

In the Dostert and Thompson system the features of each 
part of speech are represented by a series of sixteen bits 
fo. S@he Waterloo LISP system does not allow bit manipulation, 
Semtmeoor teatures and their values are placed on the 
description list of the phrase. The symbol for a feature is 
the three letter name of the feature listed in Appendix l. 
is value isweither "+" if the feature is set or "-" if it 
memmoue  olnce all parts of speech will not require the 
maximum of sixteen possible features in all cases, it was 


decided that initially a part of speech would contain no 


18 





Peauvures on its description list. In this manner it was 
implied that all possible features were set to "-" (off). 
meron iace Of implied feature values is taken into account 


in the "CHK" function. 


eee THE “CHK" FUNCTION 

The process of checking a given part of speech to insure 
that certain specified features have the desired values is 
performed by the function "CHK". The arguments for the 
function "CHK" are the phrase in question and a list of 
features, with their corresponding values, that are to be 
checked. The "CHK" function then looks on the description 
jmesu Of the phrase for each feature in the given list. If 
mee result of a comparison is true then the next feature is 
checked. If the result 1f false, then the function will 


return "NIL", indicating that the features did not match. 


fee Loe "CHG" FUNCTION 

Ascliming that the result of the "CHK" function is true, 
the new part of speech to be returned to the parser may 
require that certain features be either added or modified. 
Meaitfime the description list is the role of "CHG". "CHG" 
accepts as arguments a phrase and a list of features which 
Emcee vOwpe added to its description list, or whose values 


mem ee beecnanged from their present value. 


D. DEFINING WORDS 
Words can be added to the user's lexicon as one of 


nol FYpDes : 


NES, 





a) NAME Mary, dog, blonde 


b) CLASS pirl 
ec) RELATION location, mother 
d) VERB enroll, arrive 


The first three types result in a noun phrase being formed. 
The last type returns a verb phrase. A word is added by 


calling the parser using the format 


( DEF : WORD :=type ) 


where "type" is one of the four types listed above. To 
inmgelerstand the procedure for defining words it is necessary 
to understand several of the parsing rules which use the 


fomde ANY". The rules 


((ANY :=NAME) . NME) 
((ANY :=CLASS) .NME) 
((ANY :=RELATION) . REL) 


((ANY :=VERB) . VRB) 


mPreoauce very Similar results so that consideration of one 
of the rules will be sufficient to describe all four. 


Consider then the rule 


(CANY :=NAME) . NME) 


and an example definition 


(SDEr 27d O° NN <=NAME ). 
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When called, the function "NME" receives as arguments a list 
Mensisting of two atoms. The first call to "“NME" would 
result in the list ( N :=NAME ) being passed as arguments. 
ie” a check is made to insure that "N" is not a 
Pemee Of Speech. if it were, then "NME" would return NIL and 
ignore the part of speech "N". However, since N is not a 
part of speech, the assumption is made that "N" is part of a 
word that is being defined. The list ( N ) is then appended 
momeune value of the "NAME" on the description list of the 
atom ":=NAME" and the list (( :=NAME NAME ( N ) )) is returned 
to the parser. At this point the example definition would 


be 
( DEF : J OH ( :=NAME NAME (N ) )) 


and "NME" would be called again. 

The arguments for this call would be ( H ( :=NAME NAME 
( N ) )). When the list ( H ) was appended to the value of 
"NAME" on the description list, the new value would be the 
list ( HN ). This process of collection letters would 


continue until the definition example had the form 
( DEF : ( :=NAME NAME (JOHN ) )). 


Witheipesmeanie to “NME" the first atom in the list of arguments 
will be ":" and a different procedure will be followed. At 
this point it is assumed that all the letters of the new word 
have been collected. Therefore the atom "N1" is CONSed to 


the list ( :=NAME NAME ( J OHN ) ) forming the dotted pair 


al 





( Nl. ( :=NAME NAME ( JOHN ) )) 


foe) 1S returned to the parser. 

ier seay this polny that "NME”, “REL”, and "VRB" differ. 
fens and “VRB" would return "R1" and "V1" respectively in 
peace of "N11". 


The example definition now has the form 
( DEF ( Nl . ( :=NAME NAME ( JOHN ) ))). 


Oneswmust now consider the final parsing rule which uses 


Gine word "ANY": 
CepEt ay J. DEE), 


mmoemecalled, the function DEF is passed a list containing 
two elements. The first will always be "DEF". However, the 
second element could be either ":" or the returned value from 
Sve es heh” or "VRB". If the second element is ":" then 
it is assumed that all the letters of the word being defined 
have not been collected. Therefore, the function "DEF" 
returns NIL to the parser. If the second element is not 
".™ then it must be either ( Nl . ( :=NAME NAME ( JOHN ) )) 
or one of its variations as noted above. 

The function "DEF" sets the variable FN to Nl (which 
is the name of a predefined function in the system). The 
value of "NAME" on the description list of ":=NAME" is 
extracted. The two items are CONSed to form a new parsing 


rule 


ea 





(( ene N ) WEN). 


Meee the NCONC function of LISP the new parsing rule is 
added £0 Uictweoesve ot mparsiae rules mMamed by the function 
Povbes . At this point the word "J 0 HN" has been defined 
memcne system and is available to all future calls to the 


parser. 


E. ADDRULE 

As a researcher experiments with various forms of data 
and various syntactic constructions, he may find it necessary 
to add his own syntactic rules to the system. The researcher 
may encounter a situation not previously covered, or he may 
fees tO modify the order in which various syntactic groupings 
[eemeecing considered. For whatever reason the experimenter 
wishes to add a new syntactic rule, "ADDRULE" provides an 
emeecctive method of so doing. 

"ADDRULE" accepts as an argument a single list of the 


form 


(( parser categories => part of speech to be returned) 
( CHECK: features and values that must be true) 
( SET: features and values that are to be set for 
the returned part of speech ) 


( name of semantic operator to be called )). 


An example would be 
ADDRULE (( 


norey NP => Ne) 


aS 





( CHECK: (FHV -) * (POF -)(RLF -) ) 
( SET: 1 (FOJ +)(FPH +) ) 


( SEMNOP ) )).2 


femenls cule to apply, the part of speech, copula verb, 
must have the FHV feature turned off and the noun phrase 
must have the POF and RLF features off. The symbol "*" 
G@esvinguishes which features are to apply to which part of 
speech. In the above example, had the "*" been missing, 
then "“ADDRULE" would have assumed that all three features 
listed would apply to the first part of speech listed 
me. , CV). 

The number following "SET:" indicates the "number" of 
the part of speech whose feature list is tO be copied onto 
Maemaesecription list of the returning value. In the above 
example the "1" indicates that a copy of the feature list 
ao, 315 to be placed on the description list of "VP". 

Had the number been "0", then the features of neither of the 
input parts of speech (i.e., CV or NP) would have been copied 
to "VP", Thus by implication, all the features on "VP" 

mould heave been off. 

The list of features following the number are those 
features which are to be changed on the description list of 
the returned value. Thus from the above example it can be 


1 This format is a literal translation of the format 


used in Reference 5. 
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seen that "VP" will have at least the FOJ and FPH features 
turned on and the FHV feature turned off (since it was off 
on CV). 

"SEMNOP" is the name of a function which performs no 
operations. This indicates that there are no semantic 
Operations required for this particular grammar rule. 

Appendix 2 lists the procedures for the system. With the 
procedures described above (i.e., the parser, the definition 
functions and "ADDRULE") one will also note representative 
Procedures for the REL English Syntax, such as plural, 
possessive and part participle. These rules were not added 
to the system with the use of "ADDRULE" but were coded 
separately. 

Appendix 3 lists the major portion of the REL English 
Bymeax. Hach rule is in the format for entry into the system 
weap hne “ADDRULE" procedure. 

One of the sentences used to check the system is listed 


in Aopendix 4. The sentence was 
f[eloe THERE A HARVARD BOY WHOSE SISTER ATTENDS YALE ). 


Having parsed the sentence, the system identified the 
sentence as a subjective verb phrase (FSJ) having a singular 
Peeyece (FSI). It was also noted that the sentence is a 
question(FQT) containing an intransitive verb (FVI). After 
a successful parse of the sentence, the part of speech will 
contain the complete diagrammed sentence as the value of 


"NAME" on its description list. For the example listed in 


2 





Appendix 4, the value of "NAME" is 


MmoetneRe ¢( A (( HARVARD BOY ) WHOSE SISTER ( ATTENDS 


eee) ~+»)»)?). 


This diagrammed sentence is in a form that the researcher 
can use to readily extract pertinent information for use 


in his experimentation with data representations. 
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V. USERS MANUAL 


To facilitate use of the SSE system the user's require- 
iteaus Nave been kept to a minimum. Appendices 2 and 3 
contain the complete listing of the system. There are three 
meeoiople inputs that the user can make to the system. He 
@ioeaetine words, add syntax rules, or parse a sentence. 


To define a word the user would use the following format 


PARSE (( 


GEUppe wen D “=tyoe ) )). 
The word "type" would be replaced by 


1) NAME 
AC bss 
3) RELATION 


4) VERB 


depending on the type of word to be defined. It should be 
moved that each symbol between "DEF" and " :=type" (i.e., :, 
feo. R, and D) is an atomic symbol. 

The dora to be used to add a syntactic rule to the 
System was covered under "ADDRULE". Numerous examples of 
that format can be found in Appendix 3. 


To parse a sentence the following format would be used 


PARSE (( 


@isentence ) )) 


ef 





In this format each character in the sentence must be 


represented as an atomic symbol. For example 


PARSE ( ( 
(WHERE IS JOHN) )) 
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VI. RECOMMENDATIONS 


In developing the system to its present state, it was 
found that the full implications of several of the "machine 
semantic procedures" alluded to in the REL documentation 
maeeoemot clear. It is therefore recommended that further 
study be directed toward the development and implementation 
of these semantic procedures. 

During the parsing of a sentence it would be reasonable 
momaosume Chat the major portion of time spent determining 
the admissibility of a syntax rule would be spent checking 
features. For example there are fifteen syntax rules of the 


form 
( NP VP = VP) 


wmmeerare wsed to add subjects to a verb phrase. Therefore 
on the average there are seven rules that must be checked 
Benmore finding the correct rule to apply. From the authors 
of the REL system it was learned that an average of six 
features must be checked for each rule. Thus to find the 
One syntax rule that allows the adding of a subject to a verb 
phrase one must check forty-two features. That is forty-two 
Sseatcmes Of a description list. By adding a small number of 
bit manipulation functions to LISP, more than one feature 
could be checked at a time. Features could be implemented 
using the LISP capability for octal representation, and the 


checking function could be performed by a masking operation. 


ae 





VII. CONCLUSIONS 


peeecam De a Valuable tool to the researcher in semantic 
representations. The system is based on a@ comprehensive 
syntax and permits the parsing of complicated English 
sentences. The system returns these same sentences in 2 
format which makes pertinent information readily accessible 
memumemexperimenter. Finally, because the system is written 
iieutor it 1s transportable to any computer system offering 


Peto. 5 complier or interpreter. 
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APPENDIX 1 


NP_FEATURES 


PLE 
POF 
ar 
QNF 
APF 
PMF 
PSF 
RCF 
RLF 
INF 
NAF 
NVF 
NAS 
NMD 
NXP 


ANF 


pl waa: 

possessive 

ete mmenie i 
quantifier 
adjectural 

"of" phrase 
possessive modifed 
corel vec laulce mMogit ied 
melation 
individual 

number argument 
number valued 
addition 
Mmuvhe to Li Cat 1.0n 
exponentiation 


animate 


PeeesATURES 


FPH 


lerSial 
REP 
ad 
FPA 


FQT 


phrasal, prevents further 


morphemic modification 
Sparoular Subp jeco. only 
past participle 
infinitive 

passive 


question transformation 


Silk 


EXAMPLE 

boys 

boy's 

the boy 

all boys 

male parent 

Sister of John 

John's sister 

boy who lived in Boston 
Sissmer, Locavaon 
John, Boston 
average 

age, income 

income + capital loss 
income * tax rate 


income **2 


Mohn, giril 


gives the book 


arrived 


was given 


will John come 





FIN 
By 
FSJ 
FAG 
FOJ 
FAT 
FDA 
FAL 
FGE 


FAL 


Puerta MOU Olas e 
Mierano ke. ve Verb 
subject 

agentive 

objective 

time modified 
dative 

iicunmmment al 
genative 


locative 


CV FEATURES 


John gave books 
sleep 

John gave books 

John gave 

books were given 
arrived in June 1960 
gave to Mary 

opened with a key 
reeeived from Mary 


arrived in Boston 


same as VP with the exception of FIN. 


Replace FIN with 


FHV "have" verb 


NU FRATURES 


NAS 


NMD 


NXP 


DTF 


addition or subtraction 


MUvmeo ll Cabgon or 
Cakes on 


exponentiation 


determiner 
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the age of John 





APPENDIX 2 


SYS Tet eh Sle 


GENERAL PURPCSE PARSER 


DEFINEI4 


Pe ham | 
ae Kame) 
y= 


WW 


Ly) ) 
Ke 
Oz 


CI 
~—e 


~o 
tb 


oO @ 


UW) tu 
OO 
NDD 
mQQ 


—~ 


Baas 


(COND 


(PAR2 (L 


(ONLY TMP)) 


(ONLY (COR RES)))) 


({CAAR RES) 


(ONLY (L 
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CL) ane wr 


om om (/) 
Oa 
<t- J 
WY) = 


ole a a 
(347) 
WL) 


ee ee | 


<— JO) 
Qt 
o7n~ 
<b 


— jf eer 


( 


(PARS1 


(APG ATMP) 


TPROG (CAARS TAI) 


(PROG 


(ADDLIST 


(CDAR STA) STA 


APG) 


(COND 


Meese Ss iMeARGs) 


oOo 


~~ 


(MAKELIS 





aoe 
rim bk Y 
Cm Ye 
Ost J 
te I~ 
renmu 
a MY) 
Or<aOQ 
wl SO 
<{ ww <[ 
tS) 
MnO 
p= ~~ 
raw 
WS a. 
Mr OW 
<_ YW< 
>_ = — 


Nt = 
a <1 f— 
aZmOW 
> <1 J 


— C= 
een) ae 
LW ULI a ee 
t-AAWM 
Oa. 
mh < < Ge a 
Ow Oa 
ws eC) 


2S. 


)) 


He He HE He He He NC Ie Ie He A A 


) 


ee 
M (CDR STM) LSTM ARGS))) )) ))) 


(T (CAR X)) 


ada 
ce CO 


PARSER 


Crowe 
WD"N{a= 
UO Ode 
we OY wer OY) 
a, CO 
eJ we 
To dere 
DweDOcae 
BS Gove 
Ut SS” LL we ae 


we (Owe 


OF 


(CDR (CAAR Q)))) 
END 


(CCND (C ATOM X) X) 


(COND 
(Q) 


(SYS1 (LAMBDA (X) 


(CDAAR (LAMBDA 
ae He ae ke a eK ie aie ke He Ke 3K 





SSE FUNCTIONS 


(SEMNOP (LAMBDA (X) 
* MAKES AN ATOM FROM A GIVEN LIST 


NIL)) 


(MAKEATO 


(RETURN (MKATOM)))) 


* CHECKS THEwEEAT UREN Sas OF Sarak! OF sSPeeen 


= 1) 


CADAR TEST) Y)) CRETURN NIL))) 
EST) 


Bee] UU 
m~ 
~ om © 
> a= 
mY © 
b— aml ~ 
Y) a oe 
WW > ad 
aad <7 
~ rat @ 
1 9 TS oe 
9 tO WW <r 
© Us WY 
(o Cre ll me al a omasll © 
Ot ~x«M am) 
om iL. om} — <I 
eiyN Zz D =~ 
We NO Oy, 
ky umn SS w= 
Se mee ~~ UO 
<b > oe | bt ©) 
BO 212 =>e Je] 
Ie2De-F2C02Dki- 
Owo VZwowo 2zwoO 
DIO ~NO~O~Y M0 
SU eet ee eet es tee es ee Me 
<I 
—J 
— =O 
O 
Ye 
a =) 
cs 
t=“ 
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* CHANGES THE VALUESS OF “=EAIURESeON IE fealvRe or ot 


GCG AGES!) 


UOTE T)))) 
)) 


LF cd 


CUASS 


TONSSARE USED 70 DEFINE VAMNAME., 


))) 


om, 
om 
<I 
mu 
CLE 
~~ a) 
> Ow 
J<{ 
<I OQ _ 
lw O< ~< 
LL OU—m 
I WwW 
O— mil = 
Om <tcetu. << 
Cos eae 
a. (ee os 
= > qQ) Lu 
—mOenJIXO fF 
NON I<I~ © 
I —D 
<~m I ZoOANG 


WDM OAMOa 
me) ee Cet TO ee eee SO” 

N ap) 
UL ate Qa 
LL CE ‘3 
=) © O 
soba ant _ 
VY) 





)) 


) 


(RETURN Y) 


(GO SKIP))) 


E}) 
QUOTE N1) Z))) ))) 


M 
( 


<I 

Zayn 
Z. 

LUO 


a 


et Oy 


Ow 
co 
A 
=O 


<> 

ale 
Ir 
Swe 


>_ 


(LA 


(REL 


) 
GO SKIP))) 


Ey) 
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wet tat 

er we CY 
=) 
uae 
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CoO 


SKIP 


—~ 
~~ 


~< 
me CY 
nm < 
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=a 4 
<T LW wee LL ee 
22S 
<Itui<t WwW 
Wee ae 
— << —~O 
Gis a tu — 2 
DE F&F wo 
QogOwWO ee 
—DED <I 
ES | Ee.) aM 
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OQ — WO 
— > oe FO 


nl eS 
~~ 
= 
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wt tet 
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> kom) 


tat oe, 


ia 
a an) 
QO. <{ 
ps 


xo 


ca 
It-Z2 


MMW) 
2. we woe 


(VRB (LA 


(GO SKIP))) 





(RETURN (LIST (CONS (QUOTE V1) Z))) ))) 


— 
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te 
WY 
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~ oe 
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rr CoO) z 


OTE NP) (COR (STUFF (MAKEATOM X) 2)))) 


+)))) 
NP) (CDR (STUFF (MAKEATOM X) Z)))) 


Te VP eG DR omens (MAKEATOM X) Z)))) 


))) 


QUO 
)) 
* ADDS NEW SYNTAX RULES TO EXISTIN 


~—> 


(Ya) 


G RULES 


Sul 


FN ARGS1 RIN FEAL FEA2 FEA3 NUM FEAT 


(ADDRULE (LAMBDA ([ 


* GENERATES A UNIQUE FUNCTION NAME 


"MRULES" 
VOTE RULES)) (LIST (CONS ARGS1 FN))} 


* ADDS THE NEW PARSING RULE TO 


Q 
) 


wwe Y) 


(GO “80202 


(QUOTE CHECK:))) 


UAL (CAR ARGS) 





* COLLECTS THE FEAT URES THA A Et Oe eee letehese 


)) 
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* THE GENERAL FUNCTION 





COUCTE FEAT 


(GOSS Tee) )} 
(RETURN NIL))) 


ari 
‘am re] 
as UsuU) 
OQOGer= 
Ik Oc<t 
Ue) 
tLyyr@ 
mt wee 1) | 
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(MAKEATOM M)))) 


(PUT RTNV (QUOTE NAME) 


))) 


UE (NUE C eis) 
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VALUES FOR THE DUMMY VARIABLES 


c 


* SUBST CUP Se ihe wii 


om ey om ~~ 
em om m7; 
va? se ee ae 
~~ TOoOoOodatod 
>a S 6 | a ae fe ai aye 
= 08 dite ae a dao A Gs) 
~wWOo0000 0 
Om x Cc 
OAa—a a. 
Ce ANS 
Qe <f ange 
ZW w SS 
em LLU LL LL LL 
Pied Z, PF) 
Le ttt 
LL) f—p— f— LU p-L 
We OODOrFrOr- 
—ODDDO DO 
(eae [ee far ft eT ee, 


2a = <= 
2 WW Dw 
ILOCUILiL wun 


aed aed All aol me a need a 
NIMNNANMMM 
(aGifedfeS(s@yoa (oa qea les 
ee) a) a 
MMM NAMM NY) 


GGoeeeaaq 
ee 
WLW Wl bu Ww ly 
NNN MNUIMN 


et See eee EE ag Geet Se et 


FUNGTICN A PARDO THe EXIST INGeEUNer Lens 


NeW 


* MAKE THE 


* LIST OF RULES FOR Viiew ran sen 


om, 
> 
~ QO -« 
= 
a ee ee ee) 
= UO UO OM 
Oo Fen STS 
O qnr Oo 
oa) ~ Pe Ot— ©. Wize 
LL Li oe ee ea 
om (XL > am ~ WY WO eOm ) > 
om Li —m reiUl) fe ww CeO =e 


“~ WE ODO N>nnw ~Oedal>eoa 
TEAL WAH IR MOAOOATS se Ipe 
WRF ZeRnWwNZzvCoAoawt>e IL= = 


fF LL eC) SAN RA«ATe- OOM S& rR ¢U) 22: 
Ous Pane *eTONWMNA MTZ A O ec{ mn *U> 
DOT” FE HK CLOM OM ADMN Oe rn it Hi © I C2 


OG WNTDMDANQTMANDO enmZOotr- > FO Vz 
~~ esSqQyury Owt ana > > 
mera sta en>a se oS Pat at 
ew SHUN ON © @ POR OR GSH TO=z 
<I ce $9 00 Ce om ~™ om om Sect i Gia ©) @) LL ey > 
Me oe MOQUIPZOSORFUZOZOMD 


<I 

OWL > > > > 
Qw2227200 0000.0 625 6,000,040 66: 
SO a aaa SS se ee Zao eae eee 


4Q 





UOSZO 


ee. 4. 
Ei, lagi 
Reel at Tol 


a tet tae 


CE OSNIPSIVe OesrnEs © 


* NEW PARSING RULES WILL 


))) 


TU TENO) RULE ea I 


Se OONGl USE 


le 
x 
(PROG (FEASEEAT Y) 
* THESE ARE THE “REATURES WRG MUST BE Ws et eG Cri eS RC ee ee baleen 


(PLURAL (LAMBDA (X) 


TE —)(PLE -)(PMF -) (POF -) (PSE =] 


Q-— 


WILE BE SET 1F SHE, RUPE SAR Rites 


WHICH 


* TAS 1S THe FEATURE 


Lu 
ai 
_ 
a 
O 
-. 
= — 
_ LU 
=I Lae 
= <I 
Fas al 
oO 
Pak 
aa (@ 
i) z 
ss <I 
Lu 
6 a eal 
_ <I 
oz 
~ = 
~ coal 
wri it 
ian! Y) 
~ = 
+ om Ft 
Uu os LJ 
v= para 
<i <I 
wet) ON = OF 
as ga 
mm o 
Ly <I a. 
7 BO a SO 
Oo~ za 
=r Lu 
Ca >» i is UB 
wae) Ole oe 
ei — 
mH OOd uw 
diz Ow 
ULL} muse ce © 
LL. we LJ 
> Sr 
<2) G@) Ge ay) 
Roe O ae 
Wow _ 
NON WI 
or eer a 
SS 
fad 
Ow 
Pri ee 
mt O 
“Wo 
—_ o& 
% H 


41 


NP 
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